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Abstract 

Motivated by a derandomization of Markov chain Monte Carlo (MCMC), this paper investigates de¬ 
terministic random walks, which is a deterministic process analogous to a random walk. While there are 
several progresses on the analysis of the vertex-wise discrepancy (i.e., L^o discrepancy), little is known 
about the total variation discrepancy (i.e., Li discrepancy), which plays a significant role in the analysis 
of an FPRAS based on MCMC. This paper investigates upper bounds of the Li discrepancy between 
the expected number of tokens in a Markov chain and the number of tokens in its corresponding deter¬ 
ministic random walk. First, we give a simple but nontrivial upper bound 0{mt*) of the Li discrepancy 
for any ergodic Markov chains, where m is the number of edges of the transition diagram and t* is the 
mixing time of the Markov chain. Then, we give a better upper bound logf*) for non-oblivious 

deterministic random walks, if the corresponding Markov chain is ergodic and lazy. We also present 
some lower bounds. 

Key words: Rotor router model, Propp machine, load balancing, Markov chain Monte Carlo (MCMC), 
mixing time 


1 Introduction 

Background Markov chain Monte Carlo (MCMC) is a powerful technique of designing randomized ap¬ 
proximation algorithms for #P-hard problems. Jerrum et al. showed the equivalence in the sense of 
the polynomial time computation between almost uniform generation and randomized approximate count¬ 
ing for self-reducible problems. A number of fully polynomial-time randomized approximation schemes 
(FPRAS) based on their technique have been developed for #P-hard problems, such as the volume of a con¬ 
vex body lfT4ll25l[Tll . integral of a log-concave function |[25ll . partition function of the Ising model ifT^ . and 
counting bipartite matchings 1^. When designing an FPRAS based on the technique, it is important that 
the total variation distance of the approximate distribution from the target distribution is sufficiently small, 
and hence analyses of the mixing times of Markov chains are central issues in a series of works on MCMC 
for FPRAS to guarantee a small total variation distance is small. See also Section ITT] for the terminology of 
Markov chains. 

In contrast, not many results are known about deterministic approximation algorithms for #P-hard prob¬ 
lems. A remarkable progress is the correlation decay technique, independently devised by Weitz lOTI and 
Bandyopadhyay and Gamamik [ 51 , and there are several recent developments on the technique. For counting 
0-1 knapsack solutions, Gopalan et al. iflhl . and Stefankovic et al. |[29l gave deterministic approximation 
algorithms (see also |[T7]| ). Ando and Kijima [2 gave an FPTAS based on approximate convolutions for 
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computing the volume of a 0-1 knapsack polytope. A direct derandomization of MCMC algorithms is not 
known yet, but it holds a potential for a general scheme of designing deterministic approximation algorithms 
for #P-hard problems. Deterministic random walks |[l0l|9l|T3l|7l|23l|22l|27l may be used as a substitute for 
Markov chains, for the purpose. 

Deterministic random walk Deterministic random walk is a deterministic process analogous to a (mul¬ 
tiple) random wallfl A configuration G of M tokens distributed over a (finite) vertex set V is 
deterministically updated from time f to f -|- 1 by routers equipped on vertices. The router on a vertex u £ V 
deterministically serves tokens on u to neighboring vertex v with a ratio (about) G [0,1] such that 
'^v&v ~ i-®-’ ^ ~ (Puv) € is a transition matrix (when V is finite). See Sectionfor the 

detailed description of the model with which this paper is concerned. Note that the expected configuration 
G M>q of M tokens in a multiple random walk at time t is given by on the assumption 

that 

Cooper and Spencer ifTOl investigated the rotor-router model, which is a deterministic random walk 
corresponding to a simple random walk, and showed for the d-dimensional (infinite) integer lattice that the 
maximum vertex-wise discrepancy ||oo is upper bounded by a constant Cd, which depends only 

on d but is independent of the total number of tokens. Later, it is shown that ci ~ 2.29 f91 and C 2 is about 
7.29 or 7.83 depending on the routers |[T3l . On the other hand. Cooper et al. (71| gave an example of a rotor- 
router on the infinite /c-regular tree, such that its vertex-wise discrepancy gets D(\/^) for an arbitrarily 
fixed t. 

Mofivafed by general fransifion mafrices, Kijima ef al. |[2^ invesfigafed a rofor-roufer model on finife 
mulfidigraphs, and gave a bound 0(n|,4.|) of fhe vertex-wise discrepancy when P is rafional, ergodic and 
reversible, where n = \V\ and A denotes fhe sef of multiple edges. For an arbifrary rational fransifion mafrix 
P, Kajino ef al. ll^ gave an upper bound using fhe second largesl eigenvalue A* of P and some ofher param- 
efers of P. To deal wifh irrational fransifion probabilifies, Shiraga ef al. llT/ll presenfed a generalized nofion 
of fhe rofor-roufer model, which fhey call functional router model. They gave a bound 0((7r ma v/7rmin)t*A) 
of fhe vertex-wise discrepancy for a specific funcfional router model (namely, SRT-router model) when P is 
ergodic and reversible, where t* denotes fhe mixing rate of P and vTmax (resp. vTmin) is fhe maximum (resp. 
minimum) elemenf of fhe sfafionary disfribufion vector vr of P. Using |[27l . Shiraga el al. lf28l discussed fhe 
lime complexify of a simulation, in which fhey are concerned wifh an oblivious version, meaning lhaf fhe 
sfafes of routers are resel in each slep while fhe deferminisfic random walk above menfioned carries over fhe 
sfafes of routers fo fhe nexf step. 

Similar, or essenfially fhe same concepfs have been independenlly developed in several liferafure, such 
as load-balancing, information spreading and self-organization. Rabani el al. ll^ invesfigafed fhe diffusive 
model for load balancing, which is an oblivious version of deterministic random walk, and showed for fhe 
model lhaf fhe vertex-wise discrepancy is O (A log(n)/(I — A*)) when P is symmelric and ergodic, where 
A is fhe maximum degree of fhe fransifion diagram of P. Friedrich el al. ifTSl proposed fhe BED algorilhm 
for load balancing, which uses some exfra information in fhe previous lime, and fhey gave 0{d^'^) for 
hypercube and 0(1) for consfanl dimensional fori. Akbari ef al. discussed fhe relation belween fhe 
BED algorilhm and fhe rofor-roufer model, and gave fhe same bounds for a rofor-roufer model. Berenbrink 
el al. |j6| invesfigafed aboul cumulalively fair balancers algorilhms, which includes fhe rofor-roufer model, 
and gave an upper bound 0{d min(y^log(n)/(l — A*), ^/n)) for a lazy version of simple random walks on 
d-regular graphs. 

As a closely related fopic, fhe behavior of fhe rofor-roufer model wifh a single token has also been 
invesfigafed. Holroyd and Propp ifTSl invesfigafed fhe frequency G Z>q of visils of fhe token in t 
steps, and showed lhaf — 7r||oo is 0{mn/t). Preceding ifTSl . Yanovski ef al. 13^ showed lhaf fhe 

'“multiple random walk” means independent random walks of many tokens. 
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Conditions on P 

Loo-discrepancy 

Li-discrepancy 

E. R. 

symmetric 

0 

^ A log(n) ^ 

m 

Q ^Anlog(n)^ 

E. R. E. 

rational 

0( 

n\A\) 

m 

O(nVl) 

any rational 

0 

' a*n\A\ 

m 


E. R. 

0 

TTmax ^ 

( •'Tmin 

) m 

0 ( 

\ TTmin / 

E. R. E. 
simple r.w. 
d-regular 

0 

dmin ^ 

vW.vs)) m 

0 ^mmin ^)) 

E. 


0{mt*) Thm. 13.21 

E. E. 


0{m^yt* log t*) Thm. 14.21 

E. R. E. 
symmetric 

0{Ay/t* logt*) Thm. 14.71 



E.: ergodic, R.: reversible, L.: lazy 


Table 1: Summary of known results on ||oo for finite graphs, and this work. 


rotor-router model with a single token always stabilizes to a traversal of an Eulerian cycle after 2mD steps 
at most, where D denotes the diameter of the graph. This result implies that the (edge) cover time of the 
rotor-router model with a single token is 0{mD) for any graph. Bampas et al. |4il gave examples of which 
the stabilization time gets Q{mD). Similar analyses for the rotor-router model with many tokens have been 
developed, recently. Dereniowski et al. ifT^ investigated the cover time of the rotor-router model with M 
tokens, and gave an upper 0{mD/ log M) and an example of Q{mD/M) as a lower bound. Chalopin et 
al. I'S’l gave an upper bound of its stabilization time is -|- mD log M), while they also showed that 

the period of a cyclic stabilized states can get as large as 

Our results. As we stated before, the total variation distance between the target distribution and approx¬ 
imate samples is significant in the analysis of MCMC algorithms. While there are several works on deter¬ 
ministic random walks concerning the vertex-wise discrepancy - /rW||oo such as 112^12^I22ll27lia. 

little is known about the total variation discrepancy ||i- This paper investigates the total variation 

discrepancy to develop a new analysis technique aiming at derandomizing MCMC. 

To begin with, we give a simple but nontrivial upper bound for any ergodic finite Markov chains, pre¬ 
cisely we show = 0{mt*) where t* is the mixing rate of P and m is the number of edges 

of the transition diagram of P. In fact, the analyses are almost the same for both the non-oblivious model, 
including the rotor-router model ifTOl |2^ l22l 1^. and the oblivious model like 1261 |28l in which the states 
of routers are reset in each step, and we in Section [3] deal with the oblivious model. We also give a lower 
bound for the oblivious model presenting an example such that ||i = which suggests that 

the mixing rate is negligible in the Li discrepancy for the oblivious model. 

Then, we in Section |4] give a better upper bound for non-oblivious determinstic random walk, precisely 
we show llx^*^ — = 0{myjt* log t*) when P is ergodic and lazy. Notice that the upper bound does 

not require reversible. The analysis technique is a modification of Berenbrink et al. [6], in which they 
investigated a lazy version of simple random walks on d-regular graphs. In fact, we also remark that the 
analysis technique by |I6| for the vertex-wise discrepancy is extended to general graphs, precisely we show 
that llx^*) ~ = 0(AVf* log t*) when P is ergodic, lazy, symmetric. We also present some lower 
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bounds of Li discrepancy for non-oblivious models. 

Table[I]shows a summary of known results ll26l l2^ l22l l27l 1^ on ||oo, and the results by this 

work. The column of “Li discrepancy” shows the upper bounds of ||i implied by the previous 

results Il26ll2^ 1^1271 1^. in comparison with upper bounds obtained by this paper. 


2 Preliminaries 

2.1 Random walk / Markov chain 

As a preliminary step, we introduce some terminology of Markov chains (cf. 1241 ). Let V = {1,... , n} 
be a finite set, and let P € be a transition matrix on V, which satisfies YIvgv = 1 for any 

V G V, where Pu^v denofes fhe^w, v) enfry of P {P^ ^ denofes (u, v) enfry of P*, as well). Lef 0 = (V, £) 
be fhe fransifion digram of P, meaning fhaf £ = {{u,v) ^ V x V \ Pu,v > 0}. Lef Af~^{v) and Af~{v) 
respecfively denofe fhe ouf-neighborhood and fhe in-neighborhood of u G L on gE- For convenience, lef 
m = \£\, 6^{v) = |Af“'“(u)| and 6~{v) = |Af“(u)|. 

A finife Markov chain is called ergodic if P is irreducibl^ and aperiodi^ If is well known fhaf any 
ergodic P has a unique stationary distribution vr GM^O (i.e., ttP = tt), and fhe limit distribution is tt (i.e., 
lim^^oo = TT for any probability distribution on V). Let ^ and be probability distributions 

on V, then the total variation distance Ptv between ^ and C, is defined by 


Av(e,c) 


def. 


max 

AcV 


vGA 




The mixing time of P is defined by 


r(e) = m^minji G Z>o | Ptv(F’^,., tt) < e} 


( 1 ) 


( 2 ) 


for any e > oS Lef t* ‘^= r(l/4), called mixing rate, which is often used as a characferizafion of P. 

Lef = {pf'\ ■ ■ ■ ,tin^) G Z>g denofe an initial configuration of M tokens over V. Suppose fhaf 
each token randomly and independenfly moves according fo P. Lef denofe the expected configuration 
of tokens at time t G Z>o in a Markov chain, then = ij,Wp^ holds. By the definition of mixing time, 
II^W/M ~ 7r||i < e holds for any t > T{e) if P is ergodic. 


2.2 Deterministic random walk: framework 

A deterministic random walk is a deterministic process imitating /iF)_ Let = P^^'^ and x^*^ £ 2>o 
denote the configuration of tokens at time t G Z>o in a deterministic random walk. An update in a deter¬ 
ministic random walk is defined by denofing fhe number of tokens moving from u fo u af fime t, where 
Z^X musf safisfy fhe condition fhaf 


E = X® (3) 

u&N+iv) 

^M^{v) = {m G L I Pv,u > 0} andAf^fv) = {u € V \ Pu,v > 0}. 

^P is irreducible if Vm, v € V,3t > 0, „ > 0. Then, transition diagram of P is connected. 

^P is aperiodic if Vn G V, GCD{f G Z>o | > 0} = 1. 

. denotes the u-th row vector of P*. 
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for any v ^ V. Then, is defined by 

vGj\f~ (u) 

for any u £ V. We will explain some specific deterministic random walks in Sections [TT] and ITT] by giving 
precise definitions of Z^^u- We are interested in a question if x*'*^ approximates well in terms of the total 
variation discrepancy, i.e., the question is how large maxyicy |x^^ “ ~ ~ does get. 

In the end of this section, we introduce two notations which we will use in the paper. For any ^ G and 
ACV, let denotes X],;eA For example, = Y^v&A and = Y^v&A Pu,v For any ^ G M”, 
P G and u G V, let {CP)u denotes the u-th element of the vector ^P, i.e., {CP)u = '^vev CvPv,u- 


3 Upper and lower bounds for oblivious model 

This section is concerned with an oblivious version of deterministic random walk, which is closely related 
to the models in Il26ll28l . 


3.1 Oblivious model 

Given a transition matrix P and a configuration x^*^ of tokens, we define as follows. Assume that an 
arbitrary ordering ui,, tt 5 +(^) on is prescribed for each v £ V. Then, let 




fi) p 

Xv ^v,Ui 




X 


(i < n 

(otherwise) 


(5) 


where i* Xv^ ~ [Xv^ Pv,ui\ denotes the number of “surplus” tokens. It is easy to check that the 

condition (l3]l holds for any v,u £ V and t £ Z>o. Then, the configuration is updated according 

to dUl, recursively. The following observation is easy from the definition (l5]l of . 

Observation 3.1. For any oblivious model, \zi% — Xv'^ Pv,u\ < 1 holds for any u,v £ V and t £ Z>o. 


3.2 Upper bound 

In this section, we give an upper bound of the total variation discrepancy. 
Theorem 3.2. Suppose P £ is ergodic. Then, for any oblivious model. 


(T) (T) 

Xa -Fa 


< 2 ^^* — 


holds for any A£.V and for any T £ Z>o. 

Remark that Theorem 13.21 only assumes that P is ergodic. 

Proof of Theorem U?2\ Let fF) = ^(*) _ for convenience. By dUl and Observation 13. 11 



'X 

+ 

1 

= 

^ (zW-x^P.,.) 

< P 

— / ^ Xv ■^v,u 




vSAf~ (u) 

vGjy~ (n) 
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holds for any u ^ V and t E Z>o. Now, we see that 
T-l T-1 




t=0 


t=0 


(T-t) pt _ pT-t-l)pt+l\ _ pO _ ^^(0) pT _ ^^(T) _ „(T) 




hold, since = x^^^P^ holds by the assumption. By Q, 


(T) (T) 

Xa -^^A = 


/T-l 


T-l 


/ A t=0 uGV 


\t=0 

at*-l 


t=0 u&V t=at* ueV 

for any possible integer a, where the last inequality follows from the fact that 


E = E = E -EE = M - M = 0 


(V) 


( 8 ) 


holds for any t E Z>o. By ([8]l, we obtain that 

Qt*-1 


(T) (T) 

Xa A 


< 


t=0 uGV 


+ 


T-l 


E E*^^^ *HpIa-^a 


t=at* u&V 

Now, we give upper bounds of each term of (|9ll. For the first term of (|9ll, it is easy to see that 

at*-l at*-l 


at*-l 

E Y.^u-'^pIa 

t=0 u&V 


< E i^Ei E - E E 

u&V t=0 u&V 


t=0 


(9) 


( 10 ) 


holds by ®. To bound the second term of ([Oil, we use the following lemma (See Appendix lAl for the proof). 
Lemma 3.3. /I27l/ Suppose P E M>g” is ergodic. Then, 




t=at* 


holds for any u GV and for any a E Z>o. 
By Lemma [331 we obtain that 


E E*^^^ ^\pi,A-T^A 

t=at* u&V 


i E Ei4"-‘>i 

t=at* uev 


Pu.A - T^A 


< — E ^ — (11) 

- 2« ^ o<t<T' “ ' - 2" 

u&V - 


hold where the last inequality follows from ®. Now, we obtain the claim from (|9l), (ITOl) and ([TTI) by letting 

a = 1. □ 
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3.3 Lower bound 


We give the following lower bound for an oblivious model. This proposition imply that we cannot improve 
the term t* for oblivious models in general. 


Proposition 3.4. There exist an oblivious model such that 

= Tl{nt*) 


max 

scv 


AT) _ AT) 
Xs ds 


holds for any time T after mixing. 

Proof Let V = {0,...,n — 1}, and let a transition matrix P be defined by Pu^u = (A: — 1)/A: for any u G V, 
and Pu^v = i/k{n — 1) for any u,v G V such that u v, i.e., P denotes a simple random walk on 
with a self loop probability {k — l)/k for any vertex. For this P, it is not difficult to check t* = 0{k) (See 
Appendix 1^. Then, we give a corresponding oblivious deterministic random walk. Let us assume that the 
prescribed ordering for each v gV starts with v itself (remember the definition of an oblivious deterministic 
random walk in Section ISTI) . Let 


( 0 ) [uGA) 

\0 (rr G B), 


where A = {0,... ,n/2 — 1} and B = {n/2, ... ,n — 1}. Then, the initial configuration is stable, i.e., 
X^) = since each v G A serves [k ■ +1 = k tokens to itself (notice that the “surplus” token stays 

at V according to the prescribed ordering). Now it is easy to see that 


I d) d) I ^ 

f^^\xs-ds\ ^ 




kn 

T 


kn 

T 


kn 




— e = Tl{nt*) 


holds for any t > T(e). We obtain the claim. 


□ 


4 Upper and lower bounds for non-oblivions model 

Observation [3A] for oblivious model suggests only that \Zvli — Xv'^Pv,u\ < 1 holds for any t G Z>o. In this 
section, we introduce the SRT-router model (c.f., lUl), which satisfies | - Xv'’Pv,u)\ < 1 for 

any t G Z>o, and we obtain an improved bound when the Markov chain is laz^ 

4.1 Model 

The SRT-router model, based on the shortest remaining time (SRT) rule l|3l[30l|27l, is a generalized version 
of the rotor-router model. In the model, we define an SRT-router a^: Z>o —5- on each v G V 

for a given P. Roughly speaking, (T^(f) denotes the destination of the f-th launched token at v. Given 
iT^(O),..., (T^(f — 1), inductively ay{i) is defined as follows. First, let 

Ti{v) = {uG M^{v) I \{j G [0,f) I ay{j) = tt}| - (i + l)Pv,u < 0}, 

where [z, z') {z, z + 1,... ,z' — 1} (remark [z, z) = 0). Then, let cry(i) be u* G Ti(v) minimizing the 

value 

\{j G [0,f) I av(j) = rr}| -fl 

p 

^ v,u 

is lazy if Pu,u > 1 /2 holds for any u G V. 
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in any u E Ti{v). If there are two or more such u E T^{i), then let u* be the minimum in them in an 
arbitrary prescribed order. The ordering fT^(O), ... is known as the shortest remaining time (SRT) 

rule (see e.g., 

In an SRT-router model, there are Xv'^ tokens on a vertex v at time t, and each vertex v serves tokens 
on V to the neighboring vertices one by one according to cj^(i), like a rotor-router. For example, if there are 
a tokens on v at time t = 0, then \{j E [0, a) \ = u}| tokens move to each u E and there are 

b tokens on u at f = 1, then |{j E [a, o + 6) | cT^(j) = u}| tokens move to each u E Af~^{v), and so on. 
Formally, it is defined by 




( 12 ) 


It is clear that the definition ^Y2\ satisfies ([31). Then, fhe configuration of fokens is recursively defined by (011. 
The following proposifion is due fo Angel el al. |l3j| and Tijdeman |[30l . 


Proposition 4.1. ll^ [3l For any SRT-router model, 


\{j G [ 0 ,^) I cr^(j) 


U}\- Z- Py^u 


< 1 


holds for any v,u GV and for any z > 0. 

Proposition 0T]suggesls lhal \Zy% — Xv'^ Pv,u\ is small enough. In facl, Proposifion 14. 1 1 and (fT^ suggesl 
a slronger facl lhal 

(^v,l - xi*^-Pv,u^ = X] 1 ^ ^*^=0 Xv\Ei=0 Xv^) I ^v(j) = ^} - X] 


t=a 


t=a 


t=a 


{j G EEo ELo I = «} - ^ X^v'^Pv,u 


t=a 


< max 
z,2:'eZ>0 
s.t. z'>z 


\{j E [z, z') I ay{j) =u}\- {z' - z)Py^u\ < 2 

holds for any a,b ^ Z>o s.f. a < b. We will use (fT3]) in our analysis, in Section |4^ 


(13) 


4.2 Better upper bound for the SRT-router model 

Now, we show for ergodic and lazy P the following theorem, modifying the technique |I6|. 
Theorem 4.2. Suppose P E is ergodic and lazy- Then for any SRT model, 


(T) (T) 

Xa -Fa 


= O 


log; 


holds for any APV and for any T E Z>o. 

Proof of Theorem W^ The major difference between an oblivious model and an SRT-router model is that 


E4‘> 

= 

E E (4%-x‘i^^Pv,u) 

S E 

b 

J2(Z^l-xi'^Pv,u) 

<25-{u) (14) 

t=a 


t=a vGAf~ (u) 

(u) 

t=a 



holds for any u ^ V and 6 > a in an SRT-router model since (fT3]) holds. It is easy to check that “ 
F^a"^ I < 3mt* holds for any SRT-router model by the same argument in the proof of Theorem |T2] using (fTdl) 



























instead of ®. Thus we obtain \xa"^ ~ ^ 6m if t* = 1, 2. In the rest part of the proof, we assume that 

t* > 3, whieh suggests f* [Ig f*] > 3. We introduce the following proposition and lemma to give a better 
upper bound of the first term of (|9l). See Appendixfor the proofs. 

Proposition 4.3. Let Ft = /*• 

T T-l 

= Fxgr + ~ 

t=0 t=0 

holds for any T G Z>o and for any fi, gt {0 < i < T). ■ 

Lemma 4.4. Suppose that P G is ergodic and lazy. Then, 


<24Vr-ii 


t=o 


holds for any u and for any T G Z>o. 

Using Proposition 14.31 (fT4l) and Lemma l4!4l we obtain 


at*-l 




t=0 




at*-2 / t 


E •i’P'’ kA"' + E E (K-t - F 


< 


\ i=0 
at*-l 


t=0 \i=0 


jt+1 

.,A 


Ee 


-*) 


i=0 


at*-2 


KP\ + E 


t=o 


i=0 


pt _ pt+i 

^u,A ^u,A 


(15) 


< 25- {u) + 25- (rr) • (2Wat* - 2 - n) = 2(5" (u) (^24VaU - 2 - lo) (16) 

for any n G U, where a is an arbitrary positive integer satisfying at* > 3. Finally, (Ull, (fT6l) . (fTTl) and (fT4b 
imply that 


(T) (T) 

Xa -Ta 


s E 

uev 


at*-l 


H— E * 

2“ ^ 0<t<T 
uev — 


t=0 

< 2m^24\/at* — 2 — 10 ) + 2m- — < 2m 


t* 

2 " 


(24VFli 


t* - 2-9 


where the last inequality is obtained by letting a = [Igf*] - We obtain the claim. 


□ 


4.3 Lower bounds 


This section discusses a lower bound of the total variation discrepancy. First, we observe the following 
proposition, which is caused by the integral gap between G and G 

Proposition 4.5. Suppose that P is ergodic and its stationary distribution is uniform. Then, for any G 
Z>g with an appropriate number of tokens M, 


max 

SQV 




fi(n) 


holds for any time T after mixing. 
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We also give a better lower bound for an SRT-router model. 
Proposition 4.6. There exist an example of SRT model such that 


max 

scy 



(T) 

Ts 


n 


> Y = 


holds for any T > 0. 

See Appendix lAl for the proofs. 


4.4 Vertex-wise discrepancy 

This section presents an upper bound of the single vertex discrepancy ~ IIcxd> which is an extended 

version of phi] to ergodic, reversible and lazy Markov chains, in general. 

Theorem 4.7. Suppose P G is ergodic, reversiblJ^ and lazy- Then for any SRT-router model, 


y{T) _ (T) 


= o 



holds for any w £ V and for any T G Z>o, where A = max^gy |AA'’'(rt)|(= max^jgy \M (m)|), TTmax = 
maXi^gy and — min^g^ tt^. 

Proof If t* = 1,2, holds since holds due to IITTI . 

’^min ' ^mi n 

Now, we assume > 3, which suggests t* [Igf*] > 3. By a combination of (|9l), (1151) . (fill) and (fl4b . we 
obtain that 


X. 


(T) _ AT) 

r'w 


< 


uev 


P 


>at* — 1\ 


+ 2A 


at*-2 

E 

t=o 


El 

u&V 


T-1 


- P 


.i+1 


+22^ E E k« 


TTn 


(17) 


t=at* u&V 


holds, where a is an arbitrary positive integer satisfying at* > 3. The condition that P is reversible, i.e., 
TuPi^w = holds for any u,v £V, implies that 


E pt _ fULpt < pt 

*u,w / ^ ^ -*'W,U — ^ y *w,u ^ 

T^U VTmin “f. TTmin 

UClV UClV 


uev 


holds. Lemma 1441 implies that 

at*-2 

Y* y^lpt _pi+i 

/ ^ / V u,w u,w 

t=0 uGV 


TT-ji 


(18) 


at*-2 


TTn 


EE 

t=0 uGV 
at*—2 


w / 


’TT-JI 


:>£+! 


pi _ pi 
W,U w,u 


lt+1 

u 


TT-JI 


< 


TTn 


27r, 


IIP* - P 

/ J II UI,1i I 




at*-2 

ZiA y^ipt _pH 

— -rr . Z-^ Z_^\ 'W,U ^W, 

t=0 U&V 

"TT-JI 


ii — 


t=0 


TTn 


E ^tv p*,, 


4=0 


VTn 


A(24^^ 

in X 


at* - 2- 11 


(19) 


^ P is reversible if the detailed balance equation TtvPv,u = TtuPu,v holds for any u,v € V. Notice that a reversible ergodic P 
is symmetric if its stationary distribution is uniform, and vice versa. 


10 

























holds, as well as Lemma [33] implies that 


T-l 


T-1 


V V Lp‘ 


TT'j, 


t=at* u£V 


I TT, 


TT^ti 


^ I pt _ TT 

^ W.U 


< 


TT-j/ 


T-l 


TT-n 


EE 

t=at* u&V 

^ 2^ Av(^P^,,vrJ < 

\ / >imin ■" 


EEk. 


W,U 


t=at* uGV 


TT- 


t=at* 


holds. Thus, a combination ([TT]) . (fTSl) . ([T9l) and (l20l) implies that 


27r,„ t* 


< 2a2^ + 2A^(24v/3F^-1i)+2A—— 

’^min ’^min ^ ’^min ^ 


< 


Stt-j/ 


TTmir 


.(ASy^t* [Ig 


t*\-2- 19 


( 20 ) 


holds where the last inequality follows by letting a = [Ig f*] - We obtain the claim. 


□ 


5 Concluding Remarks 

In this paper, we gave two upper bounds of the total variation discrepancy, one is ||i = 0{mt*) 

for any ergodic Markov chains and the other is = 0{m^/t*logt*) for any lazy and ergodic 

Markov chains. We also showed some lower bounds. The gap between upper and lower bounds is a future 
work. Development of a deterministic approximation algorithm based on deterministic random walks for 
#P-hard problems is a challenge. 
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A Supplemental proofs 
A.l Proof of Lemma 13.31 

For convenience, let h{t) = max^gy Vtv{Pu .j tt). We use the following proposition to obtain Lemma 1331 

Proposition A.l. /[27I/ For any integers I’ ((" > 1) and A; (0 < A: < t*), 


h{Ft* + k) < 


holds for any u E L. 


Proof of Lemma U^ By Proposition |A3] 






holds. We obtain the claim. 


□ 


A.2 Supplemental proof of Proposition 13.41 


We give a proof of t* = 0{k) for Proposition 13.41 

Proposition A.2. Let 




k(^) (otherwise). 


Then 
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Proof. The proof is based on the coupling technique ll^ . Let Xt be a Markov chain according to P, and let 
Yt be another Markov chain with the same transition matrix P, where the transition from Yt to Yt+i depends 
on Xt such that 


'Yt iifXt+i = Xt) 

Yt+i = Xt (if Xt+i = Yt) 

^Xt+i (otherwise). 

Then, it is not difficult to see that for any Xq and Yq, 


Pr[Xt + Yf\ < 


k-l 


+ 


1 


k k{n — 1) ^ 

holds for any t G Z>o, thus Vtv{Py tt) < + k(n-i) ) coupling lemma (c.f. Edil ). Now, we 

obtain that 

loge“^ 


-1 


rie) < 


logg 

log ( V + ' 


k{n—l) 


-1 


log 1 - 


n—2 

k{n—l) 


loge ^ - 1 , 1 -1 

< -^ =-A;loge 


-1 — n-2 


k{n—l) 


n — 2 


holds, where we used the fact that log(l — x) ^ > x holds for any x (0 < x < 1). We obtain the claim. □ 


A.3 Proof of Proposition 14.31 

Proof Let Ft = Y!i=o fi- Then, ft = Ft- Ft-i holds. 

T T T 

= fogo + '^ftgt = fogo + '^{Ft-Ft-i)g, 


t=o 


t=i 

T 


t=i 


T T T-1 

fogo + ^ Ftgt - ^ Ft-igt = ^ Ftgt - ^ Ftgt+i 

t=i t=i t=o t=o 

T-l 

Frgr + ^ Ft{gt - gt+i)- 

i=0 


□ 


A.4 Proof of Lemma 14.41 

To bound Ptv {P^ ., Pu^^)^ we use the following proposition. 
Proposition A.3. yl2?l/ Suppose P G 


is ergodic and lazy. Then 


12 


Pt.{Pl,PlT) < fjl 


holds for any u GV and for any f > 0. 
Proof of Lemma W^ By Proposition IA31 

T 


12 


Y,v„(pi,pty) < i + Y^v„(pt^_,piy)<i + Y^^^ 


t=o 


t=i 


< 1 + 12 (2VT - 1 ) = 24Vt - 11 
+1 


holds, and we obtain the claim. Remark that we use the fact — 2 VT-I. 


□ 


14 



















A.5 Proof of Proposition 14.51 


Proof. Let M = {k — l/2)n be the number of tokens for an arbitrary positive integer k. Note that = 
/M converges to 1/n for any v € V since the stationary distribution is uniform. Precisely, for any 
A C LandT > r(l/(8A:)), 

n 8k 


ueA 


n 8k 


( 21 ) 


holds by the definition Q of the mixing time T(e). 

Let T be an arbitrary time, and \&i A = {v \ x'v > k}. First, we consider the case that | A| > n/2. 

(rp\ 

Then, we see that Ylv^A Av > fc| Aj holds. At the same time 


ueA 


ugA 


holds. Thus 


Efp-^p) s ‘1^11^1 n) 4 

veA ^ ^ ^ ^ 


= L|^| _ 

‘^11 8 “ 4 


where the last inequality follows |A| > n/2. We obtain the claim in the case. Next, we consider the other 
case, meaning that |A| < n/2. Then, we see that YIvgaAv E {k — 1)|A| since Xv < k for any v ^ A. 
At that time. 


E^‘P = 




ueA 


uGA 


2 


1 


n 8k 


> k- 


1 


1^1- 


n 


holds. Thus 


E 

vGA 


AT) _ (T) 

H'v A-d 


> 


1\ n\ ,, n n 


where the last inequality follows |A| > n/2. We obtain the claim. 


□ 


A.6 Proof of Proposition 14.61 

Proof. We consider a random walk on a complete graph K 2 n>, i■e■^ let V = {0,1,... , 2n' — 1} (n' € ^>o) 

and Pu^y = l/(2n') for any u,v G V. Let A = {0,1,... , n' — 1}, S = {n', n' + 1..., 2n' — 1} and let 

(Q) j (2A; + l)n^ (n € A) 

"jo (ueB), 

for an arbitrary k G Z>o. Note that M = ||x^^^l|i = (2^ + Since this P mixes in a single step, 

= {2k + l)(n')^/2 holds for any f > 0. We define fhe SRT-roufer auii) as 

au{i mod 2n') = i 

for any u £ V. Then, if is nof difficulf fo check fhaf x^a = {k + l){n')'^ and x^b ~ k{n')‘^ when t is even, 

as well as fhaf x^a ~ k{n')‘^ and x^^ = {k + l)(n')^ whaf is odd. Thus, 


max 1x5^ - pf\ >\Xa - Pa\ = ^ 


l\2 


n 


holds for any f > 0. We obfain fhe claim. 


□ 
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